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ABSTRACT 


The nuclear gene xanthine dehydrogenase (Xdh) was sequenced for 247 genera representing all major lineages of angiosperms 
and “gymnosperms,” and the results were analyzed using likelihood and parsimony methods. The overall topology is mostly 
congruent with previously published trees based on chloroplast rbcL, atpB, and matK sequences. A total of 190 of the 253 nodes 
(71%) of the Xdh tree received bootstrap support greater than 50%. The likelihood tree was comparable in robustness to the 
matK topology, which exhibited 79% of the nodes with bootstrap support greater than 50%, and to the reported 7%-24% 
support observed for individual analyses of rbcL, atpB, and 18S ribosomal DNA clades. The number of parsimony-informative 
sites (1068, 69%) was similar to that of the matK (1083, 62%) study. The likelihood tree depicts angiosperms as monophyletic, 
with Ceratophyllum L. (Ceratophyllaceae) as sister to the rest of the flowering plants, followed successively by Amborellaceae, 
Nymphaeaceae, and Austrobaileyales clades as sisters to the remaining angiosperms. Acorus L. plus the remaining monocots, 
magnoliids, and Chloranthaceae diverge after the Austrobaileyales. Eudicots are supported and include a basal grade of 
Ranunculales-Proteaceae, Sabiaceae, Trochodendraceae, Buxaceae, Gunneraceae, and Dilleniaceae-Santalaceae, which are 
subsequent sister to the remaining eudicots. The remaining eudicots are split into two clades. The first clade consists of the 
Ericales, Cornales, and euasterids I and II (lamids-campanulids). The second clade consists of the following orders: 
Saxifragales, Myrtales-Caryophyllales-Cucurbitales, Crossosomatales, Geraniales, Rosales-Fabales-Fagales, Celastrales, 
Malpighiales, Brassicales-Malvales, Oxalidales, and Sapindales. Xdh data provided good support in the Caryophyllales, 
Ericales and Cornales, euasterids I (lamids), Magnoliales and Laurales, Malvales, Rutaceae, Oxalidales, Brassicales, and 
Sapindales. A future combined analysis of Xdh and other DNA data sets will have a strong potential to enhance resolution and 
internal support for angiosperm phylogenetics and provide insights into angiosperm evolution using biparental information. 
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DNA-based phylogenetic analyses of angiosperms 
have made use of a number of chloroplast markers 
over the past 15 years (Chase et al., 1993; 
Angiosperm Phylogeny Group, 1998; Soltis et al., 
1999, 2000, 2003, 2005, 2007b; Savolainen et al., 
2000; Angiosperm Phylogeny Group II, 2003; Hilu et 
al., 2003; Jansen et al., 2007). More recently, the 
basal angiosperms have been examined using 
chloroplast, nuclear, and mitochondrial markers 
(Qiu et al., 2006). Patterns have emerged regarding 
relationships among the angiosperms: (1) Amborella 
Baill. or Amborella-Nymphaeales represents the 
earliest diverging lineage of extant angiosperms, with 
Austrobaileyales the next clade to diverge; (2) 
Ceratophyllum L. is sister to either the monocots or 
eudicots; (3) magnoliids are composed of four orders 
(Canellales, Laurales, Magnoliales, and Piperales); 
(4) the monocots, a well-defined group; and (5) the 
Ranunculales, usually the first lineage in the 
eudicots, with Buxaceae, Caryophyllales, Cornales, 
Dilleniaceae, Ericales, euasterids, Gunnerales, Pro- 


teaceae, rosids, Sabiaceae, Santalales, and Trocho- 
dendraceae following in various arrangements and 
forming well-supported clades (Angiosperm Phylog- 
eny Group II, 2003; Hilu et al., 2003; Soltis et al., 
2005). Not all lineages are stable in their positions on 
individual or combined gene trees, and many have 
weak support. Major clades within basal angiosperms 
that have varied in position include the following: 
Amborella, Austrobaileyales, Ceratophyllum, magno- 
liids, monocots, and Nymphaeaceae (Angiosperm 
Phylogeny Group II, 2003; Hilu et al., 2003; Soltis 
et al., 2003; Qiu et al., 2006; Moore et al., 2007). 
Moreover, the relationships among the major clades 
of eudicots (Caryophyllales, Cornales, Dilleniaceae, 
Ericales, euasterids, Proteaceae, rosids, Sabiaceae, 
and Santalales) remain uncertain (Angiosperm Phy- 
logeny Group II, 2003; Hilu et al., 2003; Soltis et al., 
2003; Qiu et al., 2006). Resolving these relationships 
is essential for systematics and for tracing character 
evolution. Furthermore, many other fields of study 
rely on a well-supported angiosperm phylogeny, so it 
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is crucial that phylogenetics produce a well-resolved 
topology using molecular and nonmolecular features. 
Studies of broad angiosperm phylogenetics have 
centered on using chloroplast genes individually and 
in combination (Chase et al., 1993; Soltis et al., 1997, 
2000; Savolainen et al., 2000; Hilu et al., 2003; 
Jansen et al., 2007). More recently, mitochondrial 
genes have been incorporated into basal angiosperm 
phylogenies (rpoC2, Qiu et al., 2006). Nuclear genes 
have hardly been used, and only the 18S gene has 
been extensively incorporated into multigene analy- 
ses investigating angiosperm phylogeny (Soltis et al., 
1997). On its own, 18S has provided very limited 
resolution and low internal support for many major 
clades (Soltis et al., 1997). Studies of rbcL and atpB 
chloroplast genes have provided phylogenies with 
more resolution than individual genes; however, 
topological incongruence does occur between the 
genes (Savolainen et al., 2000). Analysis of the 
chloroplast gene matK provided a tree with greater 
resolution and more support for major clades, but 
topological incongruences were observed among 
matK, rbcL, and atpB. Combining chloroplast genes 
in multigene data sets for analysis has improved both 
resolution and internal support (Savolainen et al., 
2000; Burleigh et al., 2009). Hilu et al. (2003) 
compared bootstrap/jackknife values for major nodes 
within the angiosperm trees and found the following 
supported percentages for these genes: rbcL (24%), 
atpB (14%), 18S (7%), rbcLlatpB (55%), and rbcL/ 
alpB/18S (83%). When using chloroplast genes, one 
must be a bit cautious because these genes are 
maternally inherited and, due to issues such as 
hybridization, the resulting phylogenetic reconstruc- 
tions may not reflect the “true phylogeny” or correct 
topology. The first angiosperm consensus tree based 
on combined chloroplast (rbcL and atpB) and nuclear 
(18S) genes was conducted by Soltis et al. (2000). 
Although 18S is a very slowly evolving gene, the 
combined chloroplast and nuclear consensus tree 
may be considered the currently most reliable overall 
angiosperm phylogeny because it is based on both 
maternally and biparentally inherited genes. Howev- 
er, many of the interrelationships in the angiosperm 
and land plant phylogenies are still controversial, and 
it is imperative to use information from different 
biparentally inherited genes to clarify and enhance 
the results derived largely from chloroplast genes. 
Xanthine dehydrogenase is a member of the 
molybdenum hydroxylase family of enzymes, which 
are thought to play important metabolic roles in 
purine metabolism and hormone biosynthesis. Mo- 
lybdenum (Mo) is essential for (nearly) all organisms 
and functions in more than 40 enzymes by catalyzing 
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diverse redox reactions. Only four of those enzymes 
have been found in plants: nitrate reductase; 
aldehyde oxidase(s); xanthine dehydrogenase, which 
is involved in purine catabolism and stress reactions; 
and sulphite oxidase (Mendel & Hänsch, 2002). 
Among Mo enzymes, the alignment of amino acid 
sequences permits the definition of domains that are 
well conserved, although they are variable. This study 
amplified approximately 1265 bp of the xanthine 
dehydrogenase gene (Xdh), which is located on 
chromosome 4, contig fragment no. 82, and is 
approximately 4083 bp long (information obtained 
by comparing published Arabidopsis and this study's 
sequences). The sequenced segment lies from 358 to 
1623 bp including intron 1-3 and exon 2-4. The 
Arabidopsis (DC.) Heynh. sequence contains 13 
introns and 14 exons. Hesberg et al. (2004) found 
that Arabidopsis has a 46%-47% identity to human 
and bovine Xdh and therefore is homologous to Xdh 
from other organisms. This examination. segment 
contains approximately 11 indel regions. To date, 
only one copy of the gene has been reported, except 
in silkworms and Arabidopsis, in which a second copy 
has been found. In Arabidopsis, Hesberg et al. (2004) 
reported that the physicochemical properties of 
AiXDHI and AtXDH2 might be nearly identical 
because of their high degree of sequence similarity 
(94%). 

Markers from the nuclear genome that can be used 
on a broad taxonomic scale without problems of 
paralogy are limited in number. The possibility of 
paralogy exists for all genes, not just nuclear ones, 
because gain and loss of genes by various processes 
are continuous and all genes have some probability of 
being duplicated or lost. One consequence of 
paralogy is that gene trees derived from samples of 
single genes from each species may not match the 
species tree. Having more than one gene tree to 
compare the angiosperm phylogeny is a good 
indicator of having paralogous genes. We have used 
this test of congruence among genes to evaluate 
duplications or losses within Xdh. 

This analysis provides the first angiosperm plant 
gene tree using Xdh. More recently other researchers 


ee 


have been exploring the utility of Xdh for “gymno- 
sperm" phylogenetics and have found it to be similar 
to inferences using other genes and to be highly 
informative (Peery et al., 2008). 

We compared the topology obtained from this gene 
to previously published topologies from rbcL, atpB, 
and matK genes. This assessment was done to 
achieve two major goals: (1) compare maternally 
inherited genes to a biparental gene to see if the 
topologies are more, equally, or less resolved, 
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supported, and congruent, and (2) examine and 
compare the topologies for basal angiosperms and 
eudicots among the data sets for a greater under- 
standing of seed plant relationships. 


MATERIALS AND METHODS 


TAXON SAMPLING 


Several different publications were consulted for 
taxon sampling schemes, including Savolainen et al. 
(2000), Soltis et al. (2000), Hilu et al. (2003), and 
Qiu et al. (2006). In total, 253 species representing 
164 families and 247 genera of angiosperms and 
gymnosperms were sampled, and all but one was 
sequenced specifically for this study. See Appendix 1 
for a list of taxa with accession data and GenBank 
numbers. 


DNA ISOLATION, GENE AMPLIFICATION, AND SEQUENCING 


For most samples, total DNA was isolated from 
fresh or herbarium specimens, although some was 
acquired from previously extracted samples. Leaves 
were ground with a mortar and pestle and subse- 
quently extracted using the DNEasy plant DNA 
extraction kit (Qiagen, Valencia, California, U.S.A.) 
following the manufacturer’s protocol. Specific poly- 
merase chain reaction (PCR) and sequencing primers 
were initially designed using the Arabidopsis plant 
sequences and other organisms available in GenBank 
and used by Rodriguez-Trelles et al. (2003). 
Additional primers were redesigned as needed. See 
Table 1 for a list of primers used for amplification and 
sequencing. Cloning was performed on selected taxa 
using the pGem T-easy vector system II kit (Promega 
Corporation, Madison, Wisconsin, U.S.A.) following 
the manufacturer’s protocol. PCR was performed with 
10 umol/L primers in 25-uL reactions. In general, it 
was conducted using the following program: 96°C for 
1 min. followed by 34 cycles of 94°C for 30—60 sec., 
48^—60?C for 1 min., 68°C for 60—80 sec., and finally 
72°C for 7 min. For some taxa, the initial 96°C for 1 
min. and the final 72°C for 7 min. were eliminated. 
Primers used for amplification of clones were MI3F 
and M13R. For sequencing reactions, the Big Dye 
Terminator Sequencing 3.1 kit (Applied Biosystems, 
Foster City, California, U.S.A.) was used, and 
fragments were separated on an ABI3730 DNA 
Analyzer from Applied Biosystems. 


PHYLOGENETIC ANALYSIS 


The Xdh gene was analyzed using both maximum 
parsimony (MP) and maximum likelihood (ML). For 
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Table 1. 
sequences. The Xdh gene was split into two halves for 


Primers used for the amplification along with their 


amplification (forward and reverse). All taxa used in this study 
were amplified with a combination of the primers listed below, 
except for primer 1299-GF, which was designed to amplify the 
back half in gymnosperm taxa. 


Primer name Sequence 


*A471F 5'-CAATGTGCRTTTGTY ACY CCYGG-3’ 
*486F 5'-ACYCCYGGKTTTRTIATGTCIATGTA -3' 
497F 5/-TTGTGATGTCGATGTATGC-3’ 
975R 5'-TGCTCCWGCAGCCATCCAGAG-3' 
*1362R 5'-TCWGATATTGGACTAGCWCGT-3" 
*1365R 5'-TTYAARTCHGATATYGGACTRGC-3' 
*1296F 5'-AARTGCTTYGCYGGIACACARAT-3' 
1299F 5'-TGGTTTGCTGGGA CACA AAT-3' 
1299-1F 5'-TGGTTTGCHGGSAMHCARAT-3' 
1299-GF 5'-TGCTTTGCTGGV AV YCARAT-3' 
*1869R 5'-CGAAAYTCMACCATTCCMCCAGG-3' 
1872R 5'-CGAAAYTCMACCATTCCMCC-3' 
1869-1R 5'-CKAAWYTCMACCATTCCMCCAGG-3' 
1354R 5'-CAGAAACTATCCATKICCC-3' 


* Amplification and sequencing primers used in more than 
7596 of the taxa examined. 


the MP analysis, a heuristic search with PAUP* 
(Swofford, 2004) was completed using the following 
strategy: addition sequence random, number of 
replicates 100, tree bisection-reconnection branch 
swapping, MULTREE option on, and MAXTREE 
auto-increased by 100. 

For likelihood analyses, the program MODEL- 
TEST v. 3.6 (Posada & Crandall, 1998) was used to 
select the models of nucleotide evolution. The 
program GARLI v. 0.96 (Genetic Algorithm for 
Rapid Likelihood Inference; Zwickl, 2006) imple- 
menting the general time reversible (GTR) substitu- 
tion model, with the base frequency and proportion of 
invariable sites estimated, was used for the Xdh gene 
data set. 

The parsimony bootstrapping analyses (100 repli- 
cates) were conducted in PAUP* (Swofford, 2004) 
using simple taxon addition, one tree held at each 
step during stepwise addition, tree bisection-recon- 
nection branch swapping, MULTREE option on, and 
no upper limit of MAXTREE set. 


RESULTS 


CLONES 


Among plants, only Arabidopsis has been previ- 
ously reported to have two copies of Xdh, and these 
copies have a high degree of similarity (94%). 
Hesberg et al. (2004) reported that the physicochem- 
ical properties of Arabidopsis AAXDH1 and AtXDH2 
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might be nearly identical because of their high degree 
of similarity (94%). For this study, over 80 clones 
were sequenced representing 20 taxa. Taxa that had 
more than one copy had 91%-99.8% similarity 
among sequences, such as Coriaria sarmentosa G. 
Forst. (sp.) of the Cucurbitaceae having a 9596 
similarity among cloned sequences. Only Tacca J. R. 
Forst. & G. Forst. and Polygala L. had similarities of 
7196 and 5896, respectively, among the cloned 
sequences. In both taxa, once the highly divergent 
sequence was removed, homology among the remain- 
ing sequences was 98% and 99%, respectively. 


COMPARING METHODS OF ANALYSIS 
Sequence variability 


Approximately 1265 bp were sequenced, resulting 
in 1557 aligned characters due to the insertion of 
gaps. Sequences were aligned manually using 
Sequencher 4.7 (Gene Codes Corporation, Ann 
Arbor, Michigan, U.S.A). Eleven indels ranging from 
3 to 66 bp in length were inserted in the alignment. 
Of the aligned characters, 1187 (7696) were variable, 
1068 (69%) were potentially parsimony-informative, 
and approximately 180 (1296) were missing. The 
transition:transversion ratio in Xdh was 1.10, indi- 
cating that a high level of saturation was not reached 
(ratio of 0.4 and below is an indication of highly 
saturated sequences, according to Holmquist [1983 ). 


Phylogenetic results 


Parsimony trees containing 253 taxa produced 
21,585 trees of 31,337 steps in length, with a 
consistency index (CI) — 0.11 and a retention index 
(RI) — 0.50 (940 hours used). Homoplasy levels are 
comparable with those from Soltis et al.’s (2000) 
three-gene data set (CI — 0.12, 567 taxa) and Hilu et 
al.’s (2003) matK gene data set (CI — 0.14, 374 taxa). 

The discussion is based on the likelihood tree 
because of its better resolution and support, and the 
results from the MP topology will be contrasted only 
when relevant. Bootstrap support of 5096—7496 is 
considered low, 75%-84% moderate, and greater 
than 8596 high (Chase et al., 2000). 

The likelihood tree (Figs. 1-8) has a total of 190 of 
the 253 nodes (71%) that received bootstrap support 
of 50% or greater, and support levels averaged 71%. 
The likelihood tree depicts the angiosperms as 
monophyletic (80%) with Ceratophyllum (Ceratophyl- 
laceae) as a sister to the rest of the flowering plants, 
followed successively by Amborella (Amborellaceae), 
a clade incorporating Brasenia Schreb., Nuphar Sm., 
and Nymphaea L. (Nymphaeaceae) and an Austro- 
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baileya C. T. White-Kadsura Juss.—Schisandra 
Michx. (Austrobaileyales) clade as sister to the 
remaining angiosperms. Acorus L. (7396) is sister to 
the remaining monocots (74%). A weakly supported 
magnoliid clade (42%) consists of Piperales and 
Canellales (56%) and Laurales and Magnoliales 
(90%). Acorus plus the remaining monocots, magno- 
liids, and Chloranthaceae sequentially diverge after 
the Austrobaileyales. Eudicots include a basal grade 
of Ranunculales and Proteaceae, Sabiaceae, Trocho- 
dendraceae, Buxaceae, Gunneraceae, and Dillenia- 
ceae and Santalaceae, which form a grade to the 
remaining eudicots. The grade of taxa containing 
Ranunculales and Proteaceae, Trochodendraceae, 
and Gunneraceae to the eudicots received 72%, 
7196, and 72% support, respectively. The remaining 
eudicots are split into two clades. The first clade 
consists of the Ericales, Cornales, and euasterids I 
and II (lamids and campanulids), i.e., the asterid 
clade (Asteridae, Fig. 5). The second clade consists 
of the following orders: Saxifragales, Myrtales— 
Caryophyllales-Cucurbitales, Crossosomatales, Ger- 
aniales, Rosales-Fabales-Fagales, Celastrales, Mal- 
pighiales, Brassicales-Malvales, Oxalidales, and 


Sapindales (Figs. 6—8). 


DISCUSSION 


EARLY-DIVERGING ANGIOSPERMS 


Early-diverging angiosperms have been the subject 
of intensive molecular studies (Chase et al., 1993; 
Soltis et al., 1997, 1999, 2000, 2005; Hoot et al., 
1999; Mathews & Donoghue, 1999, 2000; Parkinson 
et al., 1999; Qiu et al., 1999, 2000, 2005, 2006; 
Barkman et al., 2000; Graham & Olmstead, 2000; 
Savolainen et al., 2000; Zanis et al., 2002, 2003; 
Borsch et al., 2003; Goremykin et al., 2003; Hilu et 
al., 2003; Aoki et al., 2004; Stefanovié et al., 2004; 
Lóhne & Borsch, 2005). Recent studies have found 
the earliest diverging lineages of the angiosperms to 
be members of the Amborellaceae and Nymphaea- 
ceae, either as a clade or grade, followed by the 
Austrobaileyales (Savolainen et al., 2000, atpB; Soltis 
et al., 2000, 2005; Hilu et al., 2003; Qiu et al., 
2006). The Xdh gene had a topology supported by 
81% bootstrap values for the monophyly of the 
angiosperms consisting of Ceratophyllum, followed by 
Amborella, Nymphaeaceae, and Austrobaileyales (Fig 
1). The position of Ceratophyllum has varied widely 
in previous studies: sister to either all angiosperms 
(Chase et al., 1993; Savolainen et al., 2000 [rbcL, 
atpBlrbcL|), sister to monocots (Zanis et al., 2002, 
2003; Qiu et al., 2005; Soltis et al., 2005), sister to 
Chloranthaceae (Qiu et al., 2006), sister to eudicots 
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Figure 1. 
branches are likelihood bootstrap values. 


(Soltis et al., 2000; Hilu et al., 2003; Qiu et al., 2005, 
2006), or sister to Acorus, which is sister to the 
remaining monocots (Savolainen et al., 2000 [atpB |). 
Only one sample of Ceratophyllum was used in our 
study, which may influence the placement of this 
taxon in this analysis. The placement of Amborella 
and Nymphaeales followed by Austrobaileyales is in 
agreement with most cited studies. In one of the most 
intensive studies of the earliest diverging lineages, 
using three mitochondrial, one nuclear, and four 
chloroplast genes (Qiu et al., 2006), different 
inferences among gene data sets were shown. 
Comparison of the mitochondrial and chloroplast 
data sets found that mitochondrial genes supported 
Amborella-Nymphaeales, while chloroplast genes 
supported Amborella as sister to remaining angio- 
sperms. Depending on the data set, Ceratophyllum 
was either sister to Chloranthaceae, with a likelihood 
bootstrap value of 26%, or sister to eudicots, with a 
likelihood bootstrap value ranging from 38% to 51%. 
Only one nuclear gene was examined in the study 
from Qiu et al. (2006), and comparisons were not 
made with the mitochondrial or chloroplast topolo- 
gies. Previous studies, along with the present 
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research, suggest that resolving these relationships 
will require more research. 


CHLORANTHACEAE 


The position of Chloranthaceae has varied in 
previous studies, from sister to all of the euangio- 
sperms (Qiu et al., 2006; Cantino et al., 2007), sister 
to Ceratophyllum (Qiu et al., 2006), sister to 
monocots (Hilu et al., 2003), sister to magnoliids 
and eudicots (Zanis et al., 2002, 2003; Soltis et al., 
2005), sister to magnoliids (Savolainen et al., 2000 
[atpB data set]), or sister to monocots and magnoliids 
(Savolainen et al., 2000 [using the rbcL or atpB/rbcL 
data sets]; Soltis et al., 2000). Most of these 
placements have had very low to only moderate 
bootstrap support. This family exhibits numerous 
distinctive morphological characters and putative 
synapomorphies, and its fossil record can be traced 
back to the Early Cretaceous (Friis et al., 1986; 
Pedersen et al., 1991; Doyle et al., 2003). The Xdh 
likelihood tree found that Chloranthaceae, represent- 
ed here by two genera, forms a strongly supported 
clade (97%) that is sister to all monocots, magnoliids, 
and eudicots (Fig. 1). The most parsimonious tree 
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(MPT) differs in that Chloranthaceae was sister to all 
eudicots. 


ACORUS PLUS REMAINING MONOCOTS 


The Xdh sequences provided moderate support 
(73%) for the monophyly of the monocots, with 
Alismatales (Araceae and Tofieldiaceae) followed by 
Acorus as sister to the remaining monocots (Fig. 1). 
This topology has been found in most single and 
multigene analyses (Chase et al., 2000, 2006; 
Savolainen et al., 2000 [rbcL, atpB/rbcL]; Soltis et 
al., 2000, 2005; Hilu et al., 2003). The monophyly of 


Annals of the 
Missouri Botanical Garden 


Eudicots 
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the Alismatales received high support (10096), along 
with the monophyly of the remaining monocots 
(100%). The internal structure of the monocots is 
represented by two clades. The first clade contains 
Dioscoreales (10096) and Pandanales (Cyclanthaceae 
and Velloziaceae, 88%) with 71% support. The second 
clade includes sister to Asparagales (Xanthorrhoea- 
ceae, 100%), followed by Liliales (Smilacaceae), and 
two nested sister clades, one consisting of Poales 
(Bromeliaceae) and Arecales with weak support (32%) 
and the other of Zingiberales (Marantaceae and 
Strelitziaceae, 100%). These relationships are highly 
congruent with those suggested by Hilu et al. (2003). 
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Figure 4.  Bootstrap consensus tree of likelihood analyses among Ericales. The numbers above the branches are likelihood 
bootstrap values. 


Our results are also consistent with those of Chase et Pandanales are sister to Liliales, while in trees based 
al. (2000, 2006) based on seven genes, 185 rDNA, on Xdh, Smilax L. (Liliales) is sister to the Poales and 
partial 26S, rbcL, atpB, matK, ndhF, and atpl. The Arecales. The monocots are sister to the magnoliids (a 
exact positions of some orders are slightly different; for clade consisting of Piperales, Canellales, Laurales, 
example, in Chase et al. (2006), Dioscoreales and and Magnoliales). 
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Figure 5. Bootstrap consensus tree of likelihood analyses among euasterid I and Il. The numbers above the branches are 


likelihood bootstrap values. 


MAGNOLIIDS 


Recent studies provide strong evidence for a clade 
including only Piperales, Canellales, Laurales, and 
Magnoliales (generally referred to as the magnoliids 
or eumagnoliids). The Xdh topology is in agreement 
with this circumscription, although some earlier 
phylogenetic studies (Chase et al., 1993; Savolainen 
et al., 2000 [rbcL, atpB/rbcL|; Soltis et al., 2000) did 
not recognize the monophyly of the magnoliids or the 
relationships among the four orders (Fig. 2). Qiu et al. 
(1999, 2000, 2006) identified the sister relationships 
between Magnoliales and Laurales and between 
Canellales and Piperales, as well as the monophyly 
of all of the magnoliids. Hilu et al. (2003) also 
recovered the monophyly of the magnoliids, with 
Magnoliales-Laurales and Piperales-Canellales as 
sister groups, but with low levels of support. The Xd 


likelihood tree has high levels of support for the 


Magnoliales-Laurales —sister-group relationship 
(9096), whereas the Canellales—Piperales sister-group 
relationship received lower support (5696). These 
sister-group relationships have been found in several 
other analyses (Mathews & Donoghue, 1999, 2000; 
Qiu et al., 1999, 2000, 2005, 2006; Barkman et al., 
2000; Graham & Olmstead, 2000; Zanis et al., 2002, 
2003; Borsch et al., 2003, 2005; Hilu et al., 2003; 
Lóhne & Borsch, 2005; Soltis et al., 2005; Graham et 
al., 2006). 

In the clade consisting of the Canellales, Winter- 
aceae formed a strongly supported clade (100%) 
with Canella P. Browne (10096). The Piperales 
(7196) contained members of Saururaceae and 
Aristolochiaceae. The position of Asarum L. (Aris- 


tolochiaceae) as sister to the Saururaceae (100%) 
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Figure 6. Bootstrap consensus tree of likelihood analyses among the core eudicots. The numbers above the branches are 


likelihood bootstrap values. 


was different than that in most other analyses. 
However, this position for Asarum was also found in 
the study by Hilu et al. (2003), but, with increased 
(2006) showed that 


Asarum regrouped with members of Aristolochia- 


taxon sampling, Qiu et al. 


ceae. Morphological characters clearly place Asarum 
in Aristolochiaceae, with aristolochic acids, sepals 
connate, filaments slightly adnate to the style, ovary 
inferior, and ovules numerous among their common 
traits (Judd et al., 
may be influencing the placement of Asarum in this 


2007). Limited taxon sampling 


analysis. Increasing the sampling of Aristolochia- 
ceae, Piperaceae, and Saururaceae may result in 
different inferences. 

The Magnoliales formed a weakly supported clade 
(6596), and all families with more than one genus are 


The 


Himantandraceae, 


monophyletic. Annonaceae, Degeneriaceae, 


Eupomaliaceae, Magnoliaceae, 
and Myristicaceae have occupied various positions 
in most major analyses (Qiu et al., 1999, 2000, 2005, 
2006; Zanis et al., 2002; Hilu et al., 2003). Xdh 
sequences provide evidence that Himantandraceae is 
sister to the remaining members of the order, followed 


by Myristicaceae, Degeneriaceae, and Magnoliaceae, 


which are sister to the clade Eupomatiaceae and 
Annonaceae. 

The Laurales formed a moderately supported clade 
(78%) with all multitaxa families remaining mono- 
phyletic, except for the Lauraceae. The nonmono- 
phyly of Lauraceae was also found by Qiu et al. 
(2006) using several taxa, but Chanderbali et al. 
(2001) examined additional taxa and found a clade 
containing Cryptocarya R. Br. to be basal in the 
Lauraceae. The placements of Atherospermataceae, 
Calycanthaceae, Gomortegaceae, Hernandiaceae, 
Lauraceae, Monimiaceae, and Siparunaceae of the 
Laurales have varied in most major analyses, and 
their relationships will remain unresolved until more 
examinations are completed (Qiu et al., 1999, 2000, 
2005, 2006; Chanderbali et al., 2001; Zanis et al., 
2002; Hilu et al., 2003). 

Although the magnoliid grouping has been found 
in many other molecular analyses, sound morpholog- 
ical or phytochemical characters to define the groups 
are lacking (Doyle & Endress, 2000). Soltis et al. 
(2005) cited some chemical compounds such as 
neolignan and P-type sieve-tube plastids for most of 
the families, and Stevens (2001, onward) mentions 
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chemical compounds such as neolignan and embry- 
ological features, but there are no unambiguous 
synapomorphies for this clade. The Xdh analysis has 
provided additional evidence for the monophyly and 
relationships among the magnoliids. 


EUDICOTS 
Early-diverging eudicots 


The early-diverging eudicot taxa—Ranunculales, 
Proteaceae, Sabiaceae, Trochodendraceae, Buxaceae, 
and Gunneraceae—have been consistently recovered 
together as mainly a grade in a number of analyses 
(Chase et al., 1993; Hoot et al., 1999; Savolainen et 
al., 2000 [atpB, atpB/rbcL}; Soltis et al., 2000, 2003, 
2005; Hilu et al., 2003); however, the placement of 
these taxa has varied. Xdh data are mostly in 
agreement with results obtained from previous 
studies, although the Xdh trees are most closely 
similar to those of Soltis et al. (2005). Ranunculales 
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and Proteaceae (72%), Sabiaceae (35%), Trochoden- 
draceae (71%), Buxaceae (38%), and Gunneraceae 
(72%) form a grade in the Xdh likelihood tree (Fig. 
3). Ranunculales and Proteaceae were sister, forming 
a clade with low support (70%). This clade was not 
found by Soltis et al. (2005) and could be a result of 
the Xdh analysis having low sampling of Proteaceae, 
including only a single representative, Platanus L. 
Eschscholzia Cham. (Papaveraceae, 96%) and 
Euptelea Siebold. & Zucc. (Eupteleaceae, 50%) form 
a grade to the remaining Ranunculales, and the 
Ranunculales are sister to the remaining eudicots, 
consistent with other analyses (Chase et al., 1993; 
Soltis et al., 1997, 2000, 2003, 2005; Hoot et al., 
1999). Lardizabalaceae is a strongly supported clade 
(100%) sister to the Berberidaceae and Ranuncula- 
ceae clade (99%). Berberidaceae and Ranunculaceae 
are strongly supported as monophyletic groups (100% 
and 92%, respectively). The placement of Dillenia- 
ceae and Santalaceae, forming a clade that is sister to 
the asterids and rosids, also occurs in the MPTs. The 
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Figure 8. Bootstrap consensus tree of likelihood analyses among eurosid II. 
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sister-group relationship of Dillenia L. (Dilleniaceae) 
and Osyris L. (Santalaceae) (45%) has been found in 
other studies (Soltis et al., 2000, 2005), but the 
relationships of Dillenia are uncertain, with many 
different hypothesized relationships (compare Savo- 
lainen et al., 2000; Soltis et al., 2000, 2003, 2005; 
Hilu et al., 2003). The relationships of the Santala- 
ceae are equally uncertain (compare Savolainen et 


al., 2000; Soltis et al., 2000, 2003; Hilu et al., 2003). 


EUDICOTS CLADE I 


Asterids 


The asterid clade, sensu Olmstead et al. (1992, 
1993) and Judd and Olmstead (2004), is monophy- 
letic (58% bootstrap support) in the Xdh tree (Fig. 5) 
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and consists of the four major lineages Cornales, 
Ericales, and euasterid I and II. In the Xdh tree the 
Ericales—Cornales clade is strongly supported (97%, 
Fig. 4). The euasterids I and II (Fig. 5) formed poorly 
supported sister clades (6396). Although support 
values are difficult to compare, this topology is mostly 
in agreement with other multigene studies (Savolai- 
nen et al., 2000; Soltis et al., 2000). 


Ericales—Cornales 


The monophyly of Ericales received moderate 
support (83%) in the Xdh analysis, while the 
monophyly of the Cornales lacks bootstrap support 
(31%, Fig. 4). The resolution and support for the 
Ericales clade were similar to those in other analyses, 
including Bremer et al.’s (2002) six-gene plastid 
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analysis, Anderberg et al.’s (2002) five-gene analysis 
from plastid and mitochondrial regions, and Schó- 
nenberger et al.’s (2005) ll-gene analysis from 
nuclear, mitochondrial, and chloroplast regions. 
Within the Ericales, the Xdh analysis recovered four 
major clades, with multiple-sampled families mono- 
phyletic: (1) The first clade (100%) contains 
Tetrameristaceae and Balsaminaceae sister to Marc- 
graviaceae. (2) The second clade (100%) consists of 
the Primulaceae. (3) The third clade is tentative due 
to low support (18%) and has Sapotaceae sister to 
Lecythidaceae, which is sister to a Fouquieriaceae— 
Ebenaceae clade. The lack of samples from Polem- 
oniaceae, which typically has been found to be sister 
to Fouquieriaceae (Savolainen et al., 2000 [atpB, 
atpB/rbcL|; Anderberg et al., 2002; Bremer et al., 
2002; Soltis et al., 2005), is problematic in this 
analysis and may explain the low support for this 
group. (4) The fourth clade, also with low support, has 
two major lineages. The first contains Theaceae, 
sister to a clade containing Symplocaceae, and 
Styracaceae, sister to Diapensiaceae. The second is 
weakly supported and contains Sarraceniaceae sister 
to a clade consisting of Pentaphylacaceae, Theaceae, 
Actinidiaceae, Cyrillaceae, Clethraceae, and Erica- 
ceae. Other multigene studies have found Sarrace- 
niaceae sister to Actinidiaceae, together sister to 
Clethraceae, Cyrillaceae, and Ericaceae (Anderberg 
et al., 2002; Bremer et al., 2002). Xdh provides a 
similar inference although with varying support 
values. Using morphological data, Judd and Kron 
(1993) found the same topology for relationships with 
the core Ericales as found in the Xdh analysis. 


Euasterid I (Lamids) 


The euasterid I clade (consisting of Boraginaceae, 
Garryales, Gentianales, Lamiales, and Solanales) has 
strong bootstrap support (88%, Fig. 5) in the Xdh 
analysis, whereas most other studies have shown low 
to moderate support levels for it (Savolainen et al., 
2000 [atpB, rbcL, atpBírbcL|; Soltis et al., 2000, 
2003; Albach et al., 2001; Bremer et al., 2002; Hilu 
et al., 2003). The first branching lineage is the 
Garryales and this is in agreement with previous 
studies (Savolainen et al., 2000 [atpB, rbcL/atpB |; 
Soltis et al., 2000, 2003; Albach et al., 2001). There 
is strong support (98%) for the relationships found 
among Boraginaceae, Gentianales, Solanales, and 
Lamiales. Monophyly of the orders Gentianales 
(100%) and Lamiales (including Borago L., 97%) 
also obtained strong support. This level of support 
has been found in most other studies (Savolainen et 
al., 2000 [atpB, rbcL, atpB/rbcL|; Soltis et al., 2000; 
Albach et al., 2001; Bremer et al., 2002; Hilu et al., 
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2003). The Gentianales has Rubiaceae as sister to 
members of the Loganiaceae, Apocynaceae, and 
Gelsemiaceae. The Lamiales contains representative 
laxa from Bignoniaceae, Boraginaceae, Calceolaria- 
ceae, Gesneriaceae, and Plantaginaceae. In a clade 
representing members from the Plantaginaceae and 
Boraginaceae, Erinus L. (Plantaginaceae) is sister to 
Borago (Boraginaceae). 


Euasterid Il (Campanulids) 


The euasterid II clade is weakly supported (50%) 
and consists of Aquifoliales, Asterales, Bruniaceae, 
Dipsacales, Escalloniaceae, and Apiales (Fig. 5). 
Aquifoliales as the first branching lineage is in 
agreement with most previous studies (Chase et al., 
1993; Savolainen et al., 2000 [atpB, atpB/rbcL |; Soltis 
et al., 2000, 2003, 2005; Albach et al., 2001; Bremer 
et al., 2002) excluding those performed by matK 
analysis (Hilu et al., 2003), in which Aquifoliales 
aligned with the euasterid I (Campanulids) clade. 
Most analyses agree that euasterid Il is composed of 
four major orders (Aquifoliales, Apicales, Dipsacales, 
and Asterales); however, the sister-group relation- 
ships differ among analyses. The most recent 
multigene analysis of Burleigh et al. (2009) is in 
agreement with this study in having Aquifoliales 
basal and sister to Asterales, together sister to 
Apiales and Dipsacales. The Apiales consists of 
approximately 10 families, the Asterales consists of 
12 families, and the Aquifoliales contains approxi- 
mately five families. The Xdh study sampled eight 
families of the possible 27 from these three orders; 
additional samples from the remaining families will 
most likely increase the internal resolution. and 
provide additional support within euasterid II. 


EUDICOTS CLADE II 


The eudicots clade II likelihood tree contains the 
following taxa: Saxifragales, Myrtales—Caryophyl- 
lales-Cucurbitales, Crossosomatales, Geraniales, 
eurosid I (without Cucurbitales and Oxalidales), and 
eurosid IL, with Oxalidales recovered in the Xdh tree. 
The MPT found Saxifragales, Crossosomatales, Ger- 
aniales, eurosid I (with Cucurbitales, Fabales, 
Fagales, Myrtales, and Rosales), and eurosid II (with 
Brassicales, Celastrales, Malphigiales, Malvales, 
Oxalidales, and Sapindales) as groups. 


Saxifragales 


The Xdh analysis has low support (73%) for the 
monophyly of Saxifragales (Fig. 6). Three well- 
supported clades are resolved within this order. The 
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first clade includes Iteaceae sister to Grossulariaceae 
(100%). A similar topology was recovered by Fish- 
bein et al. (2001), Hilu et al. (2003), and Soltis et al. 
(2005). The second clade consists of a monophyletic 
Hamamelidaceae (100%), which has been document- 
ed in many other studies (Savolainen et al., 2000 
|atpB/rbcL}; Soltis et al., 2000, 2003, 2005; Fishbein 
et al, 2001; Hilu et al, 2003). A third clade 
including Altingiaceae, Cercidiphyllaceae, Daphni- 
phyllaceae, Hamamelidaceae, and Paeoniaceae is 
well supported (10096). Several major families are 
missing, such as Saxifragaceae and Crassulaceae. 
Therefore, it is still premature to comment on the 
position of the taxa in the third clade. Soltis et al. 
(2007a) also were unable to recover stable relation- 
ships among the woody Saxifragales. The Saxifragales 
circumscription contains a wide range of morpholog- 
ical diversity that departs from traditional classifica- 
tion. The nonmolecular synapomorphies have not yet 
been found and more research is needed for 
clarification. 


Myrtales—Caryophyllales—Cucurbitales 


The analyses recover a weakly supported (6%) 
clade consisting of Myrtales, Caryophyllales, and 
Cucurbitales (Fig. 6). This inference may be 
misleading or incorrect due to the weak support 
because no other analyses have grouped these orders 
together. In contrast, the MPT is like previous MP 
analyses, revealing Caryophyllales as sister to the 
eudicots, and Cucurbitales as sister to Fagales. In a 
recent analysis, the Caryophyllales clade has been 
variously placed at the base of the asterids (Hilu et 
al., 2003; Soltis et al, 2005), sister to the 
Saxifragales (Soltis et al., 2003), sister to the eudicots 
(Savolainen et al., 2000 [rbcL, atpB/rbcL|; Soltis et al., 
2000), or sister to the asterids and rosids (Savolainen 
et al., 2000 [atpB |). Cucurbitales are typically found 
associated with a nitrogen-fixing clade, which 
includes Rosales, Fagales, Fabales, and Cucurbitales 
(Soltis et al., 1995, 1997, 2000, 2003, 2005; 
Savolainen et al., 2000 [atpB, rbcL, atpB/rbcL|; Hilu 
et al., 2003). The position of the Myrtales is usually 
nested within, or sister to, a clade consisting of 
Malvales, Brassicales, and Sapindales (Soltis et al., 
1997, 2000, 2005; Savolainen et al., 2000 [atpB, 
rbcL, atpB/rbcL|; Hilu et al., 2003). The Myrtales 
clade in the Xdh data set has two members with 
partial sequences that may be creating errors in their 
placement. In all three orders, the families are 
underrepresented; we sampled only two of the 
approximately 13 families of the Myrtales, seven of 
the approximately 29 families of the Caryophyllales, 
and three of the approximately seven families of the 
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Cucurbitales. The differences between the likelihood 
and MP analyses need to be further explored using all 
genes in independent and combined analyses. Both 
factors, partial sequences and underrepresentation of 
families, are most likely contributing to this unusual 
placement of the Myrtales. 

Two families were sampled for the Myrtales (63%), 
Myrtaceae and Penaeaceae. Metrosideros Banks ex 
Gaertn., Myrciaria O. Berg, and Psidium L. formed a 
monophyletic Myrtaceae with low support (63%, Fig. 
6). This order has had various circumscriptions in the 
past, consisting of a number of families, and it would 
be premature to speculate on its composition, as well 
as relationships within the order and with other 
rosids. The monophyly of this order is in agreement 
with the results of Conti et al. (1996) based only on 
rbcL, Savolainen et al. (2000) based on atpB/rbcL, 
Soltis et al. (2000, 2005) based on rbcl/atpB/18S, 
and Hilu et al. (2003) based on matK. 

The Xdh sequence data strongly support the 
circumscription of the Caryophyllales (9596, Fig. 6). 
The monophyly of this order has been indicated in 
previous molecular studies (Savolainen et al., 2000 
[atpB, rbcL, atpB/rbcL|; Soltis et al., 2000, 2003, 
2005; Cuénoud et al., 2002; Hilu et al., 2003). Two 
clades within Caryophyllales are recognized with Xdh 
data. Caryophyllales I (10096) includes Caryophylla- 
ceae and Nyctaginaceae, followed by a well-support- 
ed Portulacaceae grade, with Cactaceae nested within 
(100%). Caryophyllales II (3996) contains a clade of 
the carnivorous families Droseraceae and Nepentha- 
ceae (98%) sister to a clade of Plumbaginaceae and 
Simmondsiaceae (5796). The position of the Sim- 
mondsiaceae has varied, being included in the 
Caryophyllales II clade in some analyses (Savolainen 
et al., 2000 [rbcL/atpB |), while, in others, it has been 
included in the Caryophyllales I clade (Soltis et al., 
2000, 2005; Cuénoud et al., 2002; Hilu et al., 2003). 
One of the most comprehensive studies of the 
Caryophyllales, by Cuénoud et al. (2002), indicates 
that the position of Simmondsia Nutt. lacks strong 
support, demonstrating that additional work is 
needed. Our results show Simmondsiaceae as part 
of the Caryophyllales II clade with strong support. 
Members of Caryophyllales have some non-DNA 
features supporting its basic circumscription, but 
many gaps still remain to be filled (for characters, see 
Nandi et al., 1998; Stevens, 2001, onwards). 

The Curcurbitales were defined by the Angiosperm 
Phylogeny Group (1998) and Angiosperm Phylogeny 
Group II (2003) as containing the following seven 
families: Anisophylleaceae, Begoniaceae, Coriaria- 
ceae, Corynocarpaceae, Cucurbitaceae, Datiscaceae, 
and Tetramelaceae. Begoniaceae, Coriariaceae, and 
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Cucurbitaceae were included in this analysis (Fig. 6), 
which found the Curcurbitales to be monophyletic, 
although this was only weakly supported (72%). 
Other gene studies have also found the Curcurbitales 
to be monophyletic (Savolainen et al., 2000 [atpB, 
rbcL, atpB/rbcL |; Soltis et al., 2000, 2003, 2005; Hilu 
et al., 2003; Zhang et al., 2006). Morphological and 
phytochemical synapomorphies for the Curcurbitales 
are still unclear; even the most recent study by Zhang 
et al. (2006), using 13 molecular markers and 
morphological characters, could not identify a 
synapomorphy for the order. 


Crossosomatales 


The Crossosomatales clade, here consisting of 
Staphyleaceae and Stachyuraceae, is well supported 
(100%, Fig. 6). Both the rbcL/atpB analysis (Savolai- 
nen et al., 2000) and the analysis of 18S, rbcL, and 
atpB (Soltis et al., 2000) contained a well-supported 
Crossosomatales clade. Stachyuraceae has previously 
been placed in the Violales, whereas Staphyleaceae 
has been placed in the Sapindales (Cronquist, 1981). 
Thus, another group of families previously placed in 
distantly related orders come together in a well- 
supported clade. Matthews and Endress (2005) 
investigated the floral characters and revealed 
potential new synapomorphies for a close relationship 
among the core member of Crossosomatales. Among 
the prominent floral features of Crossosomatales are 
solitary flowers, presence of a floral cup, imbricate 
sepals with outermost smaller than inner, pollen 
grains with horizontally extended endoapertures, 
shortly stalked gynoecium, postgenitally united 
carpel tips forming a compitum, stigmatic papillae 
2-cellular or more, ovary locules tapering upward, 
long integuments forming zigzag micropyles, cell 
clusters with bundles of long yellow crystals, 
mucilage cells, seeds with smooth, sclerified testa 
and without differentiated tegmen. However, more 
molecular and morphological work is needed to 
resolve both the composition of this clade and its 
position within the rosids. 


Geraniales 


Two taxa of Geraniales were sampled for the Xdh 
analysis, Geranium L. (Geraniaceae) and Greyia 
Hook. & Harv. (Melianthaceae) (Fig. 6). These two 
families are held together in a weakly supported 
clade (6196). These families were previously placed 
(Cronquist, 1981; Takhtajan, 1997) in different 
orders, and no known morphological synapomorphies 
hold the present order together. Geraniales have been 
found in previous analyses (Savolainen et al., 2000 
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[atpB/rbcL|; Soltis et al., 2000) to be sister to the 
Crossosomatales. While there is no support for that 
sister relationship here, likelihood values at the base 
of the grades leading up to the eurosids are low (12% 
or less) and rearrangements may be expected with 
additional sampling of taxa and inclusion/combina- 
tion with more data sets. 


Eurosid 1 


In the analysis, the nitrogen-fixing clade (including 
Rosales, Fagales, Fabales, and Cucurbitales; Soltis et 
al., 1995), excluding Cucurbitales, is weakly sup- 
ported at the base of the eurosid I grade (20%), 
followed by Celastrales (88%) and Malpighiales (< 
50%) (Fig. 7). Three nitrogen-fixing subclades 
received low to strong support and have no support 
as a clade (28%): Fabales (87%), Fagales (100%), 
and Rosales (65%). These subclades have been 
placed together with the order Cucurbitales in other 
single and multigene analyses (Savolainen et al., 
2000 [atpB, rbcL, atpB/rbcL|; Soltis et al., 2000, 
2003; Hilu et al., 2003). The Fabales consist of well- 
supported Fabaceae (10096) and Quillajaceae. The 
order is sister to the Fagales, which contains a mostly 
well-supported clade of Fagaceae, Betulaceae, Myr- 
icaceae, and Juglandaceae. The Rosales are sister to 
Fabales and Fagales. This order consists of Rosaceae 
sister to the rest of the order followed by Rhamnaceae 
sister to Humulus L. (Cannabaceae), Ulmaceae sister 
to Celtis L. (Cannabaceae), and Moraceae sister to 
Urticaceae. The Cannabaceae are not monophyletic 
in the Xdh analysis because they group with members 
of the Rhamnaceae and Ulmaceae; however, the 
placement of Humulus is based on a partial sequence, 
which may have caused anomalies. Soltis et al. 
(2005) found similar clades including a nonmono- 
phyletic Cannabaceae in the Rosaceae. As mentioned 
earlier, the MP analysis found eurosid I to consist of 
Cucurbitales, Fabales, Fagales, Myrtales, and Ro- 
sales. 

The Xdh analysis supports (10096) the Celastrales 
clade (10096, Fig. 7). Most recent analyses support 
the order Celastrales, including various families 
placed next to the Malpighiales (Zhang & Simmons, 
2006), Oxalidales (Hilu et al., 2003; Soltis et al., 
2005), or the Oxalidales and Malpighiales (Soltis et 
al., 2000). The Xdh likelihood and MP analysis 
showed Celastrales in a well-supported clade sister to 
the Malpighiales and eurosid II clades (88%). 

The Malpighiales form a weakly supported clade 
with three supported subclades: Salicaceae (100%), 
Malpighiaceae (10096), and part of the Euphorbia- 
ceae (73%, Fig. 7). The Euphorbiaceae are not 
monophyletic, as also found in the analyses by Soltis 
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et al. (2000) and Tokuoka and Tobe (2006, four- 
marker study). In the Xdh analysis, Codiaeum 
variegatum (L.) Rumph. ex A. Juss., Euphorbia 
nerüfolia L., and E. pseudocactus A. Berger of the 
Euphorbiaceae formed a clade, whereas Phyllanthus 
flexuosus Müll. Arg. (Euphorbiaceae—now Phyllan- 
thaceae) formed a sister grouping with Passiflora 
suberosa L. (Passifloraceae). This study sampled the 
following seven families: Chrysobalanaceae, Clusia- 
ceae, Euphorbiaceae, Malpighiaceae, Passifloraceae, 
Salicaceae, and Violaceae. Previous analyses have 
found that the Malpighiales encompass at least 28 
morphologically diverse families (Soltis et al., 2000). 
The relationships among these families are poorly 
understood and further studies incorporating addi- 
tional taxa and non-DNA characters are needed to 
resolve relationships within this heterogeneous order. 


Eurosid II (including Oxalidales) 


The Xdh analysis produced a very weakly 
supported (< 50%) eurosid II clade, including 
members of the Brassicales, Malvales, Oxalidales, 
and Sapindales (Fig. 8). Each of these orders was 
recovered in the Xdh tree with strong support: 
Brassicales (91%), Malvales (100%), Oxalidales 
(97%), and Sapindales (100%). The Xdh data 
provided support for a sister-group relationship of 
Brassicales and Malvales (82%). This relationship 
was also obtained in rbcl/aipB analyses but was not 
well supported (Savolainen et al., 2000), although in 
the matK analysis these orders were strongly 
supported (8996, Hilu et al., 2003). In a family-level 
analysis examining glucosinolate biosynthesis taxa 
using rbcL and 18S, this sister-group relationship of 
Brassicales and Malvales was also found (Rodman et 
al.; 1998). A clade consisting of the Sapindales, 
Brassicales, and Malvales has been found in some 
other recent analyses, but typically this association 
has not been well supported (Savolainen et al., 2000 
[rbcL/atpB|; Soltis et al., 2000, 2003; Hilu et al., 
2003). In Soltis et al.’s (2000, 2003) three- and four- 
gene analyses, weak to moderate support was found 
for Sapindales being sister to Malvales, whereas no 
support was found for an association with Brassicales. 

This study included representatives of Brassica- 
ceae, Caricaceae, Resedaceae, and Tropaeolaceae of 
Brassicales (Fig. 8). Arabidopsis and Capparis L. 
(Brassicaceae) formed a well-supported clade (100%) 
sister to Reseda L. (Resedaceae, 100%), whereas 
Tropaeolum L. (Tropaeolaceae) and Carica L. 
(Caricaceae) formed a weakly supported clade 
(54%). Chemical compounds including glucosino- 
lates from phenyalanine and/or tyrosine and myrice- 
tin and other methylated flavonols may provide 
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synapomorphies for the order (Stevens, 2001, 
onward). 

The Malvales comprised a well-supported (100%) 
order consisting of Sterculia L. (Malvaceae), Bixa L. 
(Bixaceae) (86%), Hopea Roxb. (Dipterocarpaceae) 
(100%), and Cistus L. and Helianthemum Mill. 
(Cistaceae) (100%, Fig. 8). It is premature to 
speculate on the familial relationships of Brassicales 
and Malvales using Xdh until additional taxa are 
sequenced. 

The Sapindales are well supported (100%), with a 
number of well-supported subclades: (1) a strongly 
supported (100%) sister group of Anacardiaceae 
(100%) and Burseraceae; (2) a well-supported 
(100%) Sapindaceae with Aesculus L. and Acer L.; 
(3) a well-supported clade (100%) of Meliaceae, 
Simaroubaceae, and Rutaceae; and (4) a well- 
supported Rutaceae (91%), excluding Spathelia L. 
with the Aurantioideae (100%) and Ruta L. basal 
(65%) to the Aurantioideae, as has been previously 
found (Chase et al., 1999). Secondary compounds 
such as the ethereal oils and myricetin provide 
synapomorphies for the order (Morton, unpublished 
data). 

Oxalidales has been positioned as sister to 
Celastrales and Huaceae (< 50%) (Savolainen et 
al., 2000 [atpB/rbcL]), Celastrales (Hilu et al., 2003; 
Soltis et al., 2005), and Malpighiales (Soltis et al., 
2000 |< 50%], 2003 |< 5096]. The inclusion of the 
Oxalidales as sister to the Sapindales (5696) in the 
Xdh analysis is unique but not well supported (56%). 
The MP analysis, however, supports Celastrales and 
Malpighiales sister to Oxalidales, and this clade is 
sister to a Brassicales-Malvales-Sapindales clade. 
The phylogenetic analyses of Soltis et al. (2005) 
indicated that Oxalidales consists of six families: 
Brunelliaceae, Cephalotaceae, Connaraceae, Cuno- 
niaceae, Elaeocarpaceae, and Oxalidaceae. This 
study included a well-supported clade of Cephalota- 
ceae, Cunoniaceae, Elaeocarpaceae, and Oxalidaceae 
(9796). These families have all been previously 
placed in different orders, Cephalotaceae and 
Cunoniaceae in Rosales, Elaeocarpaceae in Malvales, 
and Oxalidaceae in Geraniales (Cronquist, 1981). 
Thus, here the Oxalidales comprise taxa that were 
once considered distantly related. Currently, no 
morphological synapomorphies are evident for euro- 
sid II with or without Oxalidales (Judd et al., 1994, 
2007). 

The rosids remain inadequately understood as to 
supported phylogenetic positions, non-DNA synapo- 
morphies for major clades, and patterns of character 
evolution. Most of these groups contain unexpected 
assemblages of taxa that previously have been placed 
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in distantly related orders. For these new groupings, 
many gaps in the non-DNA information render many 
features phylogenetically uninformative. 


CONCLUSION 


The Xdh tree of 247 genera and 1265 bp produced 
a likelihood tree that is robust and mostly congruent 
with other single or multigene studies (Savolainen et 
al., 2000; Soltis et al., 2000, 2003, 2005; Hilu et al., 
2003; Qiu et al., 2006). The potential utility of Xdh in 
angiosperm phylogenetics can be gauged by the 
number of genera and base pairs used to produce this 
topology in comparison to other single and multigene 
analyses in which substantially more genera and base 
pairs were used to produce similar topologies and 
support levels. The Xdh topology demonstrated 7196 
of the nodes with bootstrap support greater than 5096 
in comparison to the matK topology (374 taxa), with 
79%, and the reported 776-2496 for individual 
analyses of rbcL, atpB (357 taxa), and 18S (223 taxa) 
ribosomal DNA clades (Hilu et al., 2003: table 2, 
1761). The number of nodes receiving greater than 
5096 support in the Xdh analysis was in the same 
range as was obtained for the matK and the combined 
analyses of three to four genes from two genomic 
regions. The number of parsimony-informative sites 
(1068—1069) was similar to the matK analysis (1083— 
1062). 

Xdh analysis provides good support values for 
monophyly of the Caryophyllales, Ericales, and 
Cornales, euasterids | (lamids) Magnoliales and 
Laurales, Malvales, Rutaceae, Oxalidales, Brassi- 
cales, and Sapindales. 

As stated earlier, paralogy exists for all genes. 
More than one copy was found in selected taxa; 
however, most had high sequence similarity, sug- 
gesting recent duplication. Among two taxa that had 
lower values, the highly divergent sequence was 
easily recognized. When the divergent sequence was 
removed, the remaining sequences had 9896-9996 
homology. Having more than one gene phylogeny to 
compare the angiosperm phylogeny is a good 
indicator of paralogy. The likelihood and MP topology 
of the Xdh gene are comparable to other individual 
gene data sets. The potential utility of this gene is 
quite promising at the ordinal, familial, and generic 
levels. 

Although phylogenetic relationships of angio- 
sperms have been explored using a number of gene 
sequences from the chloroplast and mitochondria, the 
nucleus remains poorly explored. Phylogenetics has 
achieved some understanding of angiosperm rela- 
tionships, and the present study contributes to our 
knowledge. There are still parts of the tree that 
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remain unresolved or unsupported by both molecular 
and morphological data. As Chase et al. (2006) 
discovered, adding mitochondrial and nuclear genes 
to four plastid genes produced a highly congruent 
result that is an improvement over their predecessors. 
We must continue to combine chloroplast, mitochon- 
drial, nuclear, and morphological data to improve our 
understanding of angiosperm evolution. 
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